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ABSTRACT 

The jackknife and bootstrap methods are becoming more 
popular in research. Although the two approaches have similar goals 
and employ similar strategies, information is lacking with regard to 
the comparability of their results. This study systematically 
investigated the issue for a canonical correlation analysis, using 
data from four random samples from the National Education 
Longitudinal Study of 1988. Some conspicuous discrepancies are 
observed mainly under small sample size conditions, and this raises 
some concern when researchers need to choose between the two for 
their small samples. Due to the lack of theoretical sampling 
distributions in canonical analysis, it is unclear which method had 
superior performance. It is suggested that Monte Carlo simulation is 
needed for this kind of comparison. It is also suggested that caution 
is warranted in generalizing the results to other statistical 
techniques, since the validity of such generalizations is uncertain. 
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Abstract 

The jackknife and bootstrap methods are becoming more 
popular in research. Although the two approaches have similar 
goals and employ similar strategies, information is lacking with 
regard to the comparability of their results. This study 
systematically investigated the issue for a canonical correlation 
analysis. Some conspicuous discrepancies are observed mainly 
under small sample size conditions, and this raises some concern 
when researchers need to choose between the two for their small 
samples. Due to the lack of theoretical sampling distributions 
in canonical analysis, it is unclear which method had superior 
performance. It is suggested that Monte Carlo simulation is 
needed for this kind of comparison. It is also suggested that 
caution is warranted in generalizing the results to other 
statistical techniques, since the validity of such generalization 
is uncertain. 
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Background 

In educational and psychological research, the over-reliance 
on statistical significance testing has been questioned on 
several grounds. Issues have been raised which are related to 
the function of sample size, the validity of theoretical 
assumptions, and the meaning and interpretation of statistical 
significance (Carver, 1978; Shaver, 1993; Thompson, 1993). 
Partially due to these concerns, some alternative approaches with 
less emphasis on theoretical assumptions and more on empirical 
estimation become increasingly popular. 

The best-known empirical procedures which rely on data 
resampling are the jackknife and bootstrap methods. These two 
methods have been used either as nonparametric approaches for 
constructing empirical sampling distributions (Efron, 1981, 
1982) , or as procedures for investigating the invariance or 
stability of sample results (Thompson, 1993) . Although both 
procedures rely on data resampling, and are considered similar in 
their goals, the similarities and potential differences between 
the two approaches have not been systematically investigated. 
Thus it is yet unclear to what extent the results froA these two 
procedures will be comparable- 

Conceptually, jackknife and bootstrap techniques share many 
things in common. The major thrust of these two techniques lies 
in the fact that, instead of relying upon the theoretical 
assumptions to derive sampling distributions for statistical 
estimators, and instead of letting sample size drive statistical 
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decisions, these two approaches empirically estimate sampling 
distributions for a given sample, using the information contained 
within the sample of observations at hand (Diaconis & Efron, 
1983; Efron, 1979; Quenouille, 1949; Tukey, 1958). Research 
decisions, be it statistical decisions such as significance 
testing, or substantive decisions such as evaluating the 
invariance or stability of sample results, will be made in 
relation to the empirically-estimated sampling distributions or 
variability. 

Though inherently similar in their goal and in their general 
strategy in accomplishing the goal, the jackknife and bootstrap 
methods employ tactically different data resampling techniques. 
The following two sections briefly review the two approaches. 
The Jackknife 

Based on the earlier work of Quenouille (1949) , Tukey 
(1958) refined the jackknife technique and introduced the 
estimate for standard error. Crask and Perreault (1977) defined 
the purpose of the jackknife approach as partitioning out "the 
impact of effect of a particular subset of data on an estimate 
derived from the total sample" (p. 61). In the jackknife 
analysis, the N observations in the sample are divided into K 
equal subsets. One analysis is conducted and the estimator is 
obtained after deleting each subset, and K analyses will be 
conducted altogether. In usual practice, each deleted subset 
consists of one observation only, thus K = N. In other words, 
one observation is deleted for each analysis. This, however, is 
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not a theoretical necessity for implementing the jackknife 
analysis, and each subset can consist of any number of 
observations, as long as the v;hole dataset is equally divided 
into K subsets. 

For estimating the mean and standard error in the jackknife 
analysis, let 8(i,= 0( Xi+i, . . .Xr) be the value of the 

statistic (any statistic of interest) obtained when Xi (Xi, one 
subset out of the K equal subsets from the original sample) is 
deleted from the data set, and let 8 = (1/K) £ 0{i, . Then the 
mean 0 is the estimate for the parameter 0, and the standard 
error for the estimator 0 is obtained through the following 
formula (Efron & Gong, 1983): 



6e = J^Ef.(V^)^ 



In some sense, the jackknife is not a pure data resampling 
technique, and it does involve certain amount of algebraic 
manipulation. With regard to data resampling, the number of 
analyses in the jackknife is limited by the sample size, since 
only K (number of subsets) analyses are possible. This condition 
poses certain limitation on the jackknife as a data resampling 
technique, and such limitation could be especially conspicuous 
for small samples. Furthermore, it is unclear whether, for a 
given sample, the size for each of the K subsets will cause any 
systematic differences in results. In other words, does it 
matter if one observation is deleted for each jackknife analysis 
compared to five observations deleted each time? 
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The Bootstrap 

Pioneered by Efron (1979) as another empirical approach for 
estimating sampling distribution for almost any estimator, the 
bootstrap has a more liberal resampling strategy. From the 
original sample of size N, repeated sampling is carried out for 
sample size of n, using sampling with replacement . In usual 
practice, n = N, that is, every bootstrap sample has the same 
number of observations as in the original sample. For every 
resampling, due to sampling with replacement, some observations 
in original sample may be missing, and some others may be sampled 
more than once. Again, n = N is not a theoretical necessity, and 
n can be any number smaller than N. For each bootstrap sample of 
size n, analysis is conducted and the estimator is obtained. 
Through repeated resampling, the distribution for the estimator 
is empirically established. Unlike the jackknife approach, the 
number of analyses in the bootstrap is unlimited. To a large 
degree, the bootstrap resembles classical Monte Carlo simulation, 
except that the sampling frame is limited to the original sample 
rather than from a population universe. 

As data resampling techniques, the jackknife and bootstrap 
procedures may be useful in three situations. First, for 
parametric statistics with theoretical assumptions, when the 
validity of the assumptions for the given data is either in 
question or difficult to satisfy, the jackknife or the bootstrap 
can be employed as alternatives to the parametric approach. In 
doing so, researchers rely on empirical estimation rather than on 
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questionable theoretical assumptions for the given data. 

The second situation in which the jackknife or the bootstrap 
may be more useful is when no theoretical sampling distributions 
exist. Such is the case for some multivariate techniques 
(Johnson & Wichern, 1982) . For example, in exploratory factor 
analysis, no theoretical sampling distribution is available for 
factor pattern coefficients. Also, in discriminant analysis and 
canonical correlation analysis, no theoretical sampling 
distributions exist for either function coefficients, structure 
coefficients, or canonical correlation coefficients. In these 
situations, the inferential aspect of these analyses becomes very 
problematic or even impossible. If researchers have any desire 
to evaluate these statistics, empirical approaches such as the 
jackknife, the bootstrap, or the classical Monte Carlo 
simulation, become the only viable course of action. 

The third situation where these data-resampling techniques 
may benefit researchers is to estimate the extent to which 
research results are likely to replicate for other similar 
samples (Thompson, 1993) . Although the ultimate test for 
research results replicability is to conduct actual replication 
studies, practically this is often impossible. By employing 
data-resampling methods, estimates for such replicability can be 
obtained. Although such estimates are likely to be inflated due 
to the fact that resampling is carried out on a single sample, it 
is still better than no estimate at all (Thompson, 1993) . 

Due to the different resampling strategies employed in the 
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jackknife and the bootstrap, potentially, the results from these 
two techniques may differ to certai degree. Given the close 
association of the two techniques, one would think that the 
comparability between the two approaches should have been 
adequately addressed. Surprisingly, this is not the case. There 
has been a lack of empirical information about the comparability 
of results from the two approaches, in spite of some occasional 
attempts (e.g., Efron & Gong, 1983; Dalgleish, 1992). 

Quantitative researchers are constantly in situations where 
choices need to be made among different methods, and the choice 
between the jackknife and the bootstrap is just one example. If 
we want to make judicious and well-grounded choice, we need 
better understanding about the similarity and potential 
difference between the two approaches. Unless empirical evidence 
about the issue accumulates, our choice between the two 
approaches will likely be a haphazard one. 

It is the purpose of this paper to investigate 
systematically the similarities and potential differences between 
the jackknife and the bootstrap, two popular data-resampling 
techniques. Canonical correlation analysis is chosen for this 
investigation. The choice of canonical correlation analysis for 
this investigation reflects two considerations: (1) many 
statistics in canonical correlation analysis lack theoretical 
sampling distributions, so potentially, the jackknife and the 
bootstrap can be more useful for such analysis; (2) canonical 
analysis is a very general statistical model which subsumes many 
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other statistical techniques as its special cases (Fan, 1992; 
Knapp, 1978; Thompson, 1991). 

Methods 

The data used in this study are four random samples drawn 
from the National Education Longitudinal Study of 1988 (NELS:88) , 
a large-scale database of a nationally representative sample. 
The survey study and data collection were designed and conducted 
by the National Center for Education Statistics (NCES) , U.S. 
Department of Education. Detailed information and description 
for the datasets within this database are available elsewhere 
(NCES, 1994). 

Six variables were used in the study: test scores of 
reading, math and science obtained in 1988 (predictor variable 
set in canonical correlation analysis) and the same three test 
scores obtained four years later in 1992 (criterion variable set 
in the canonical correlation analysis) . For each subject area, 
the test scores obtained four years apart were equated to be on 
the same measurement scale through Item Response Theory equating 
(NCES, 1994) . Psychometrically, this canonical correlation 
analysis between the two variable sets could be conceptualized as 
resembling a multivariate predictive validity study. 

To evaluate the comparability of results from the jackknife 
and the bootstrap methods, two factors were considered to have 
potential impact on the outcomes: 1) the size of the original 
sample to which the two techniques would be applied (Does certain 
sample size condition produce more comparable results between the 
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two approaches than some others?) ; 2) the number of observations 
dropped in each jackknife analysis (Does the deletion of one 
observation each time or five observations each time produce more 
comparable jackknife results with those from the bootstrap?) . 
Based on these two considerations, the following jackknife and 
bootstrap experiments were designed and implemented . 
The Jackknife Experiments 

Four random samples of different sizes were drawn from the 
database (N = 200, N = 100, N = 50, and N = 20). By varying 
jackknife deletion for each sample size condition, a total of 15 
jackknife experiments were conducted, as represented in Table 1. 
It is important to keep in mind that the maximum number of 
jackknife analyses for a given sample of size N is limited to be 
N/lWatodf with n<toiotod being the number of observations deleted for 
each jackknife analysis. As the number of observations deleted 
for each jackknife analysis increases, the maximum number of 
jackknife analyses for the given sample proportionately 
decreases. One cell in Table 1 was left empty, since only two 
jackknife analyses were possible for this cell condition, and 
this was judged to be too small. 



Insert Table 1 about here 



This design for jackknife experiments allows systematic 
examination of the impact of both the sample size condition and 
the jackknife deletion strategy variations on the comparability 
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of results from the jackknife and bootstrap approaches. In all 
these jackknife experiments, canonical analyses were conducted 
and the jackknifed mean and standard error for the statistics of 
interest were obtained. 
The Bootstrap Experiments 

Four bootstrap experiments were conducted, each for one of 
the four samples previously used in the jackknife experiments. 
Unlike the jackknife, the sample size imposes no constraints on 
bootstrap resampling. From each of the four original samples 
(N=200, N=100, N=50, and N=20) , one thousand bootstrap samples 
were drawn. Canonical correlation analysis was conducted for 
each bootstrap sample, and the bootstrapped mean and standard 
error for the statistics of interest were obtained. These 
bootstraps id estimators and their standard errors were then 
compared with those from the jackknife experiments to provide an 
empirical estimate for the discrepancy of results from the two 
approaches . 

All data sampling, jackknife and bootstrap resampling for 
the experiments, and all calculations for canonical correlation 
analysis were implemented using the Interactive Matrix Language 
under SAS (PROC IML) (SAS Institute Inc., 1989). The following 
estimators were obtained from the jackknife and the bootstrap 
experiments: canonical correlation coefficients (Canonical Rs) , 
canonical structure coefficients (SC) , and canonical function 
coefficients (FC) . Comparisons were made for these statistics 
obtained from the two approaches. 
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Results and Discussions 

Due to space limitations, only part of analysis results from 
the bootstrap and the jackknife experiments are presented and 
discussed in detail. Specifically, the following statistics were 
discussed: three canonical correlation coefficients, the function 
coefficients for the predictor variable set (1988 test scores) on 
the first canonical function, and the structure coefficients for 
the criterion variable set (1992 test scores^ with its own first 
canonical function. 

Tables 2, 3, and 4 present the differences between the 
bootstrap and the jackknife estimates on canonical Rs, structure 
coefficients, and function coefficients, respectively. Generally 
speaking, in the application of data resampling methods, standard 
error estimation, i.e., estimation for the variability of a 
statistic, rather than estimation for the statistic itself, is 
often the focus of inquiry. In Tables 2, 3, and 4, both 
differences on the statistic of interest and those on the 
estimated standard errors were presented. Also, both the mean of 
differences and the mean of absolute differences were presented. 
Difference between estimates from the two approaches was 
constructed as: 

Difference = ] Estimatej,cwcij.fe| - 1 Estimateboot«trapl 
So, relative to the bootstrapped value, positive difference 
indicates a larger jackknifed value, and negative difference a 
smaller jackknifed value. 

The mean of differences and the mean of absolute differences 
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convey different meanings. The former provides an indication of 
whether jackknife tends to provide larger or smaller values 
compared to those from the bootstrap. The mean of absolute 
differences, on the other hand, provides an indication about the 
degree of absolute differences in results between the two 
approaches, regardless of the direction of difference, because 
positive and negative differences do not cancel each other out in 
the calculation for the mean of absolute differences. 

Insert Table 2 about here 
Insert Table 3 about here 
Insert Table 4 about here 



Although it may not be immediately obvious what, if any, 
tentative conclusions could be drawn from the data in these three 
tables, careful examination did reveal certain trends. The first 
question which could be tentatively answered is whether 
jackknifed estimates have any tendency of being systematically 
larger or smaller in value than bootstrapped estimates. For this 
question, the mean of differences could be useful. Although in 
Table 2 (canonical Rs) , the jackknife estimates for standard 
errors all exceeded those from bootstrap (all positive values) , 
the reverse was observed from Tables 3 and 4 (structure 
coefficients and function coefficients) . So overall no 
consistent pattern was observed. This indicates a lack of 
tendency as to whether the jackknife would provide larger or 
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smaller estimates for the statistics and their standard errors. 

The jackknife experiments were systematically varied on two 
dimensions: sariple size condition and jackknife deletion 
strategy. The three tables (Tables 2, 3, and 4) seem to provide 
clearer indication about the impact of these two variables on the 
differences between the bootstrap and the jackknife results. To 
evaluate the impact of these two variables, the mean of absolute 
differences in the tables is useful. In the three tables, larger 
differences tend to appear under smaller sample size conditions. 
For example, in Table 2, the mean of absolute differences on 
canonical Rs increases almost ten-fold (from .006 to .055, see 
the last row of the upper half of the table) from sample size of 
200 to sample size of 20. Change of similar magnitude in the 
same direction is observed fdr the mean of absolute differences 
on standard error estimation of the canonical Rs (.005 to .043, 
see the last row of the lower half of the table) . The variation 
of jackknife deletion strategy, on the other hand, does not seem 
to have systematic impact on the differences between the two 
approaches. Almost identical trends were observed for structure 
coefficients (Table 3) and function coefficients (Table 4) . 

To verify the observed trends discussed above, analysis of 
variance (ANOVA) was conducted to test the impact of sample size 
condition and jackknife deletion variation on the differences 
between the jackknife and the bootstrap results. Table 5 
presents the ANOVA results for absolute differences on canonical 
Rs, structure coefficients, and function coefficients. Both 
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differences on the statistic of interest and those on the 
estimated standard errors were used as dependent variables in the 
analyses. 



Insert Table 5 about here 



Table 5 indicates that the degree of discrepancy between the 
results of the bootstrap and jackknife experiments is mainly a 
function of sample size condition. Sample size is consistently 
tested to be statistically significant with large effect sizes 
(R-Squares ranging from 0.42 to 0.72), confirming the previous 
observation that the smaller the sample size, the larger 
difference the results from the two approaches tend to have. The 
jackknife deletion strategy variation, on the other hand, turns 
out to have no consistent impact on the observed differences. In 
other words, whether one, five or ten observations are deleted at 
a time for jackknife analysis, such variation does not 
systematically influence the differences of results between the 
jackknife and bootstrap approaches. The interaction effect 
between sample size condition and jacKknife deletion strategy is 
also negligible. These results are consistent for both the 
different statistics (canonical Rs, structure and function 
coefficients), and for the standard error^i of these statistics. 

What kind of discrepancy could we expect from the two data 
resampling techniques? Table 6 attempts to provide a tentative 
answer to this question. As discussed previously, in most 
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applications of datd resampling techniques, estimation of 
standard error or variability of a statistic is often the major 
concern. Table 6 summarizes differences in standard error 
estimation between the two approaches. Each entry in the table 
is the ratio of mean of absolute standard error differences 
between the jackknife and bootstrap experiments to the 
bootstrapped standard error, and the ratio is expressed in 
percentages. These entries are most easily understood as 
follows: to construct a confidence interval for a statistic, how 
different such confidence interval widths could be based on the 
jackknife and bootstrap methods? Large values in the table 
indicate large differences in confidence interval width. An 
entry of 10/100 simply means that the jackknifed confidence 
interval is approximately 10% wider or narrower than the 
bootstrapped confidence interval. 

In Table 6, for sample size condition of 200, the average 
confidence interval width difference is about 7% across the three 
types of statistics: canonical Rs, structure coefficients, and 
function coefficients. In other words, the jackknifed confidence 
interval is about 7% wider or narrower than bootstrapped 
confidence interval (see the last row in the table) . But for the 
sample size condition of 20, the difference in confidence 
interval width is about 40%, a substantial difference. Put in 
other words, under this sample size condition, jackknifed 
confidence interval is about 40% wider or narrower than 
bootstrapped confidence interval. This indicates that the 
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estimated sampling variability based on the two approaches can 
have substantial difference for small sample size conditions. 



This study empirically assessed the comparability of results 
obtained from the jackknife and the bootstrap approaches, and a 
case of canonical correlation analysis was used in the 
investigation. The investigation is systematic in the sense that 
both different sample size conditions and different jackknife 
deletion strategies were considered, and the impact of these 
variations on the results assessed. 

The results from the study indicate that the disparity 
between jackknife and bootstrap results is primarily affected by 
the size of a sample to which the two techniques are applied. 
When sample size is large, the difference from the two approaches 
is small, or even negligible. For small samples, the estimated 
variability for a statistic from the two approaches, expressed in 
terms of confidence interval width, can be substantially 
different. This finding seems to be in line with some previous 
observations. Efron and Gong (198 3) indicated that, compared 
with bootstrap approach, the estimated standard error from the 
jackknife approach exhibited more variation in their experiments. 

The variation of the jackknife deletion strategy, that is, 
how many observations to drop for each jackknife analysis, does 
not seem to have any systematic impact on the jackknife results. 
This finding holds for the different types of statistics and 
their standard errors investigated in this study. 



Conclusions 
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The finding that substantial difference may exist between 
results from the jackknife and the bootstrap has some practical 
implications. Data resampling methods tend to be utilized more 
in situations where sample size is relatively small (Oxaconis & 
Efron, 1980) . For small samples, the potential large difference 
in results from the two approaches will be a legitimate concern. 
Researchers interested in using these methods should have the 
sense about which method performs better in what situations. 

Because the statistics investigated in this study do not 
have theoretical sampling distributions, it is not possible to 
make statements about which approach produced estimates closer to 
the "true' values, or which approach performed better. The best 
estiiiiciLion of sampling distributions for these statistics can be 
obtained from Monte Carlo simulation which samples from a 
population universe. In order to judge which of these two data 
resampling techniques performs better, probably they will have to 
be compared with Monte Carlo simulation results. 

The study investigated a case of canonical correlation 
analysis. It is not known how generalizable these findings will 
be for other statistical techniques. Whatever tentative 
conclusions drawn in this study are thus limited to canonicc 1 
correlation analysis. Without further empirical studies, caution 
is warranted in making generalizations beyond what has been 
studied here. 
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Table 1 

Jackknife Experiments Varied on Sample Size and Deletion Strategy 



Sample Size 



Deletion 
Strategy 


200 


100 


50 


20 


Delete 1 


200* 


100 


50 


20 


Delete 2 


100 


50 


25 


10 


Delete 5 


40 


20 


10 


4 


Delete 10 


20 


10 


5 


_b 


a the maximum 


number 


of jackknife 


analyses possible 


for each 



cell condition 

b No jackknife experiment for this cell condition, since the 
possible number of jackknife analyses (2) is judged to be too 
small . 
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Table 2 

Jakcknife/Bootstrap Differences; Canonical Rs 



For Estimates of Canonical Rs 



Sample Size 



Deletion 










Across 


Strategy 


200 


100 


50 


20 


Column" 


Delete 1 


000* 


000 


"006 


-023 


-007 




1 006| 


^ |013l 


1 020 1 


1 070 1 


1 027 1 


Delete 2 


000 


000 


-006 


-023 


-007 




1 006| 




1 n 1 

1 w ^ w 


1 nfiT 1 

1 VJ Q J 1 




Delete 5 


000 


003 


-006 


-033 


-009 




1 006 


1 m n 1 
1 u±u 1 


1 n 1 

1 w ^ w 1 


1 n*^ 1 
\j ^ ^ 


1 m 7 1 


Delete 10 


000 


003 


-003 


— — — 


000 




1 006 


I 010 1 


1 023 1 




1013 1 


Across 


000 


001 


-005 


-026 


-006^ 


Row'' 


1 006 


|011| 


1 020 1 


1 055 1 


1 02ll 


For Estimates of 


standard 


Errors 


of Canonical Rs 




Delete 1 


003 


008 


017 


034 


015 




1 003 


10131 


10171 


10341 


|015l 


Delete 2 


004 


012 


025 


013 


014 




1 004 


1 |012l 


1 025| 


1024 1 


|016| 


Delete 5 


007 


020 


038 


009 


019 




1 007 


1 |020| 


1 038 1 


1 072 1 


10341 


Delete 10 


005 


014 


015 




Cll 




1 005 


1 10141 


|015l 




1 oiil 


Across 


005 


013 


024 


019 


015 


Row 


1 005 


1 1013| 


1024| 


|043| 


|020l 
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a: mean of differences across canonical Rs. All entries 

presented with decimal point omitted at the third decimal 
place. 

b: mean of absolute differences across canonical Rs. 
c: averaged across sample size conditions for each deletion 
variation. 

d: averaged across the jackknife deletion variations for each 

sample size condition, 
e: averaged across both sample size conditions and deletion 

variations. 
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Table 3 

Jakcknife/Boots trap Differences; Structure Coefficients '^ 



For Estimates of Structure Coefficients 



Sample Size 



Deletion 










Across 




Strategy 


200 


100 


50 


20 


Column*^ 




Delete 1 


006^ 


000 


023 


050 


020 




1 


006 1^= 


1 000 1 


1 023 1 


t A i— A t 

1 050 1 


1 020 1 




Delete 2 


006 


000 


023 


050 


020 




1 

1 


006 1 


1 000 1 


1 023 1 
I I 


1 050 1 


1 020 1 




Delete 5 


006 


000 


020 


030 


014 




1 

i 


006 1 


1 000 1 


1 020 1 
1 1 


1 030 1 
I I 


1 014 1 




Delete 10 


003 


000 


016 




006 




1 

1 


003 1 


1 000 1 


1 016 1 




1 006 1 
1 1 




Across 


005 


000 


020 


043 


015^ 




KOW 


005] 


1 000 1 


1 020 1 


1043 1 


10151 




For Estimates 


of Standard 


Errors 


of Structure 


Coefficients 




Delete 1 


000 


000 


-005 


-116 


-030 






000 1 


1 000 1 


1 007 1 


|1161 


1 03X1 




Delete 2 


000 


000 


-024 


-087 


-027 






001| 


i 003 I 


|024l 


1 087 1 


1 028 1 




Delete 5 


003 


001 


-047 


-014 


-014 






003 1 


1 002 1 


1047 1 


1 036 1 


1 022 1 




Delete 10 


004 


000 


016 




-005 






004 1 


1 004 1 


|016| 




loiol 




Across 


001 


000 


-024 


-072 


-020 




Row 


002 1 


1 002 1 


1 025| 


1 079 1 


|024l 
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a: The table presents data for the structure coefficients of 
the criterion variable set on its own 1st canonical 
function.. All entries are presented with the decimal point 
omitted at the third decimal place. 

b: mean of differences averaged across the structure 

coefficients on the three variables in the predictor 
variable set. 

c: mean of absolute differences across the structure 
coefficients. 

d: averaged across sample size conditions for each deletion 
variation. 

e: averaged across deletion variations for each sample size 
condition. 

f: averaged across both sample size conditions and deletion 
variations . 



Q 2b 
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Table 4 

Jakcknife/Bootstrap Differences; Function Coefficients' 



For Estimates of Function Coefficients 







Sample 


Size 






Deletion 










Across 


Strategy 


200 


100 


50 


20 


Column'* 


Delete 1 


003^ 


003 


000 


016 


005 


1 
1 


003 1 ° 


1 010 1 


1 020 1 


1 036 1 


1 m 7 1 


Delete 2 


003 


003 


000 


010 


004 


1 


003 1 


1 1 


1 020 1 


1 030 1 


|015| 


Delete 5 


003 


003 


000 


-003 


000 


1 


003 1 


|010| 


1 013 1 
I I 


1 030 1 
I I 


1 0 14 1 


Delete 10 


003 


000 


000 




001 


1 

1 


003 1 


1 006 1 


1 006 1 
I I 




1 005 1 


Across 


003 


002 


000 


007 


003^ 


Row* 1 


003 1 


1 009 1 


|015l 


|032 1 


|013| 


For Estiinates 


of Standard 


Errors 


of Function 


Coefficients 


Delete 1 


003 


005 


032 


-073 


-008 




003 1 


|005| 


1 032 1 


1 073 1 


1 028 1 


Delete 2 


002 


003 


-060 


-085 


-035 




002| 


1 007 1 


1 060| 


1 085 1 


1 039 1 


De] ete 5 


-009 


-017 


-008 


041 


001 




009 1 


1 017 1 


1 039 1 


|041| 


1 026 1 


Delete 10 


-003 


-020 


-042 




-022 




010 1 


1 022 1 


|042l 




|025l 


Across 


-001 


-007 


-019 


-039 


-015 


Row 


006 j 


1 013 1 


|043l 


1 1 


|030| 
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a: The table presents data for the function coefficients of the 

predictor variable set on its own 1st canonical function. 

All entires are presented with the decimal point omitted at 

the third decimal place, 
b: mean of differences across the function coefficients, 
c: mean of absolute differences across the function 

coefficients . 

d: averaged across sample size conditions for each deletion 
variation . 

e: averaged across deletion variations for each sample size 
condition. 

f: averaged across both sample size conditions and deletion 
variations. 
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Table 5 

Impact of Sample size and Jackknife Deletion Strategy on thp, 
Discrepancv bet ween Bootstrap and Jackknife Restults 



Absolute 

Difference on Factors p r2 



Rs* 




Sample size (SS) 


.0004 


.40 






Deletion Variation (DV) 


.7395 


. 02 






SS * DV 


.9234 


. 05 


SE^ for 


Rs 


Sample Size (SS) 


.0041 


. 28 






Deletion Variation (DV) 


.1298 


. 10 






SS * DV 


.7676 


. 08 


SC^ 




Sample size (SS) 


.0001 


. 72 






Deletion Variation (DV) 


.2446 


. 02 






SS * DV 


.4788 


. 04 


SE for 


sc 


Sample Size fSS) 


.0001 


.48 






Deletion Variation fDV^ 


8QRP 


n 1 
• yj X 






SS * DV 


.1083 


. 13 






Sample Size (SS) 


. 0001 


.57 






Deletion Variation (DV) 


.3434 . 


. 03 






SS * DV 


.8469 


.03 


SE for 


FC 


Sample size (SS) 


.0002 


.43 






Deletion Variation (DV) 


.7035 


. 02 






SS * DV 


.7912 


.07 


2ii 



Jackknife and Bootstrap 

a canonical correlation coefficients 

b standard error 

c structure coefficients 

d function coefficients 
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Table 6 

Standard Error Estimate Differences between Jakcknife and 
Bootstrap Approaches Expressed as Percentages 



Sample Size Conditions 



200 100 50 20 



SE* (Canonical Rs) 9/100^ 12/100 30/100 43/100 

SE (Sturcture 7/100 12/100 32/100 52/100 
Coefficients) 

SE (Function 6/100 11/100 18/100 25/100 
Coefficients) 



Across Rows° 7/100 12/100 27/100 40/100 



a Standard Error 

b Thio percentage expresses the average percentage difference 
between jackknifed standard error and bootstrapped standard 
error, and is obtained by: (mean of absolute standar error 
difference / bootstrapped standard error) x 100/100. 

c averaged across three types of statistics for the same sample 
size condition 



ERIC 



31 



